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Abstract 

Interference alignment is degree of freedom optimal in A'-user MIMO interference channels and 
many previous works have studied the transceiver designs. However, these works predominantly focus 
on networks with perfect channel state information at the transmitters and symmetrical interference 
topology. In this paper, we consider a limited feedback system with heterogeneous path loss and spatial 
correlations, and investigate how the dynamics of the interference topology can be exploited to improve 
the feedback efficiency. We propose a novel spatial codebook design, and perform dynamic quantization 
via bit allocations to adapt to the asymmetry of the interference topology. We bound the system throughput 
under the proposed dynamic scheme in terms of the transmit SNR, feedback bits and the interference 
topology parameters. It is shown that when the number of feedback bits scales with SNR as Cg • log SNR, 
the sum degrees of freedom of the network are preserved. Moreover, the value of scaling coefficient Cg 
can be significantly reduced in networks with asymmetric interference topology. 

I. Introduction 

A. Prior Works 

The capacity region for the interference channel remains unknown, although researchers have been 
working on it for more than thirty years ||T|, Q. Conventional schemes either treat interference as 
noise or use channel orthogonalization to avoid interference. However, these schemes are non-capacity 
achieving in general. Interference alignment (lA), which tries to align the aggregate interference from 
different transmitters (Txs) into a lower dimensional subspace at each receiver (Rx), is shown to be 
degree of freedom (DoF) optimal in interference channels ||T| as well as other network scenarios such 
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as the MIMO-X channels |j3|. In addition, despite the fact that lA is optimal only at high SNR, the I A 
method potentially gives simpler solutions because the direct channels are not needed to compute the 
precoders and decorrelators ||4|. As such, there is a surge in the research interests of lA. 

To implement lA, signal dimensions are needed and those dimensions can be obtained via symbol 
extension (time or frequency domain) or by multiple antennas (spatial domain) |[T|, Q. Existing I A 
deisgn involving symbol extensions has high signal dimension^ |j2| and is hard to implement in practice. 
As a result, many recent lA works have considered lA solutions in the spatial domain, i.e., without 
symbol extensions However, these approaches are all based on the assumption of perfect channel 

state information at the transmitters (CSIT), which is hard to obtain in practice. As such, we shall focus 
on studying the limited feedback design and the associated performance analysis of lA. 

The issue of limited feedback on MIMO networks is widely studied in the research community. For 
instance, for MIMO broadcast channels (BC) with zero-forcing beamforming, the performance loss due 
to limited feedback is studied in |[7|, ||8|. However, these works cannot be easily extended to MIMO 
interference channels with lA processing as the analysis highly depends on the BC topology and the 
zero-forcing strategy at Txs. There are some works that consider MIMO interference networks adopting 
LA under limited feedback. For instance, in |9|, lA with analog feedback is considered and the performance 



degradation is studied. In 1 10|, a new quantization scheme is studied to reduce the quantization distortion 
on MIMO interference networks. However, these works have considered homogeneous path loss and i.i.d 
fading and thus failed to exploit the potential benefits introduced by the asymmetric interference topology. 



Besides these works, there are also some papers 1 11 1, 1 12| that investigate the feedback bits scaling law 



on MIMO interference networks. The authors show that it is sufficient to maintain the maximum DoF 
feasible by lA when the number of CSI feedback bits at each Rx node scales on 0(log(SNR)). However, 
these works analyze the scaling law in the high feedback bits regime only and thus fail to quantize the 
network performance when we have finite feedback bits. Moreover, the potential possibility brought by 
heterogeneous path loss and spatial correlations to reduce the scaling bits are not explored. 

B. Remaining Challenges 

In this paper, we consider MIMO interference networks with heterogeneous path loss as well as spatial 
correlations, and focus on investigating the limited feedback performance of lA in spatial domain. In 



'The lA solution exploiting symbol extensions yj, j2j requires 0{(KN)^^ ^ ) (K denotes the number of Tx-Rx pair, A'^ 
the number of antennas at each node) dimension of signal space to achieve the optimal DoF, which is difficult to realize in 
practice. 
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view of the prior works, there are two key technical challenges that need to be addressed. 

• How to exploit heterogeneous path loss and spatial correlations to reduce the limited feedback? 

Traditionally, the CSI matrices are stacked into long vectors and then quantized by regular vector 



quantization (VQ) |13|, |14|. However, as these schemes adopt symmetric codebooks, they are 
inefficient when the channel matrices are spatially correlated fTSf , |16| or have heterogeneous 
path loss. Intuitively, when the links in the MIMO interference networks have spatial correlations, 
the normalized channel matrices will no longer be isotropic over the Grassmanian subspace p7| . 
Furthermore, if the links in the MIMO interference network have different path loss and spatial 
correlations, they should not be allocated the same amount of bits for limited feedback. The challenge 
is therefore how to exploit this asymmetry in the network topology to improve the efficiency of 
limited feedback. There are some works on point-to-point MIMO links that exploit the spatial 



correlations to improve limited feedback performance |18|, |19|. However, these works require 
closed-form precoders and hence, they cannot be extended to our problem with general MIMO 
interference network topology where there is no closed-form lA transceivers. In this paper, we 
propose a novel spatial codebook design to exploit the spatial correlations on MIMO interference 
networks. There are some works that consider dynamic bits allocations, such as the feedback bits 
partitioning between desired and interfering channels |20|, to improve the feedback efficiency. 
Motivated by this idea, we further perform dynamic quantization via bit allocations for different 
interference links to further exploit the asymmetry of the interference topology. 
• What is the trade-off between the feedback rate and the network throughput in general 
asymmetric MIMO interference networks? In literature, there are very limited works that analyze 
the performance loss due to limited feedback for lA on MIMO interference networks. In [21], the 
author gives some analysis on the trade-off between the throughput loss and limited feedback rate. 
However, the approach in this work relies on the closed-form lA solution for the precoder and hence, 
only the specialized topology and single stream transmission case is analyzed. Yet, the approach 
cannot be extended to our case because of the lack of closed-form LA solution for precoders. In this 
paper, we shall study the tradeoff between the network throughput and feedback rate based on the 
proposed scheme for more general interference topologies. From the analysis, we can obtain useful 
insights on how the system performance depends on parameters of the network topology such as 
the path loss and the spatial correlations. 
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C. Outline of the Paper 

This paper is organized as follows. In Section II, we give our system model of i^-user MIMO 
interference networks with heterogeneous path loss as well as spatial correlations, and specify the limited 
feedback topology. In Section III, we shall address the first technical challenge. We first illustrate the 
potential advantage of heterogeneous path loss and spatial correlations on the improvement of feedback 
efficiency using a toy example. Based on that, we then propose a novel spatial codebook design as 
well as dynamic quantization via bit allocations to adapt to the interference topology. In Section IV, we 
shall address the second challenge. We analyze network throughput under the proposed limited feedback 
scheme and give the throughput bounds. In Section V, we compare the performances of the proposed 
dynamic feedback scheme with several baselines via simulations. Through both analysis and simulations, 
we show that by exploiting the heterogeneous path loss and spatial correlations in the MIMO interference 
network, the proposed scheme significantly improves the system performance in a wide range of operation 
regimes. 

Notations: The following notations are used in the paper. Uppercase and lowercase boldface denote 
matrices and vectors respectively. The operators (•)*, (•)-^, i-)^ , || • ||, vec(-), IE{-}, Tr(-), rank(-), 0, are 
complex conjugate, transpose, conjugate transpose, Frobenius norm, stacking vectorization, expectation, 
trace, rank, Kronecker product operator respectively, diag(-) denotes forming matrix operator using the 
inputs as diagonal blocks, span({a}) denotes the linear space spanned by the vectors in {a}, log(-) is 
the logarithm of base 2, 0{-) denotes the asymptotic upper bound, i.e., /(x) = 0{g{x)) if there is a 
positive constant M such that < M ■ \g{x)\ for all sufficiently large x. 

II. System Model 

In this section, we shall first elaborate the interference network topology with heterogeneous path 
loss and spatial correlation. We further define the notion of interference topology profile and illustrate by 
using some examples. Finally, we will elaborate the limited feedback topology for the MIMO interference 
network with lA processing. 

A. Topology of the MIMO Interference Network 

We consider a K-user MIMO interference network in which each Tx is equipped with Nt antennas 
and each Rx with Nr antennas as shown in Fig. [T] (A). Denote the transmit SNR at each Tx as P, the 
large scale fading gain from Tx i to Rx j as Iji, the small scale fading matrix from Tx i to Rx j as 
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Hjj E C ''^ Let d be the number of data streams transmitted by each Tx-Rx pair. The received signal 
Yj G C^^^ at the Rx j is given by: 

K 

= ^^^Uf H,, V,x, + Uf ( J] /^^H.iV.Xi + z,), Vi G {1, 2, • • • K] (1) 

where Xj ~ CA/'(0, ^Irf) is the encoded information symbol at Tx i for corresponding Rx i, Vj G i^^^Ntxd. 
the transmit precoding matrix of Tx i, \]j G C^''^'^ the decorrelator of Rx j, and Zj G C^"^^^ the complex 
Gaussian noise with zero mean and unit variance. We have the following assumption regarding Hjj by 
using the Kronecker correlation model \22\ . 
Assumption 1 (Channel Fading Model): The channel matrix Hjj in this paper is given by: 

H,, = (*,^,,)V=H-($*,)V^ (2) 

wher^H^. G C^'^^^' and each entry of it is i.i.d. CAA(0, 1), G C^'-^^^ G C^'^^* are 
deterministic positive semi-definite (PSD) matrices which stand for the spatial correlation matrices at 
Rx, Tx side respectively, are normalized such that Tr($jJ = Nr, Tr($*j) = Nt. Denote 

MJ. = rank(*5.), Mj- = rank(** ■) (0 < Mj- < iV^, < Mj- < Nt), and the non-zero eigenvalues of 
**j as ■ ■ ■ Aji,A/;J, {o-jj,!, • • • <^ji,M}^ respectively. ■ 

B. Interference Topology Profile 

In this section, we define the notion of interference topology profile iZTV) which is used to capture 
the heterogeneous path loss and spatial correlations in the MIMO interference network. 

Definition 1 (Interference Topology Profile): We define the set of all the channel statistics ZTV = 
as the interference topology profile. ■ 

As such, the ZTV fully characterizes the heterogeneity of the path loss and spatial correlation among 
the interference links. We give several examples below with a {K = 4:, Nt = 3, Nr = 2, d = 1) 
interference network. 

• A fully connected MIMO interference network with i.i.d. Rayleigh fading: If = = I, 

Iji = 1, Vi, j G {1, • • - 4}, then the MIMO interference network reduces to the conventional fully 
connected interference channel in which all the elements of the channel matrices {Hj^} are i.i.d. 
Rayleigh fading. 

^We define the square root of a PSD matrix $ as #3 = FAsF^, where $ = FAF^ denotes the eigenvalue decomposition. 
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Figure 1. System model. 



• A fully connected MIMO interference network with asymmetric spatial correlation: Due to 

local scattering effects, the MIMO channel matrix Hjj may not be i.i.d. and in some cases, there 
will be spatial correlations. For instance, if = diag([ 2.8 0.1 0.1 ]), other = I, all 
= I, Iji = 1 in ITV, then this network corresponds to an example of a fully connected MIMO 
interference network with asymmetric spatial correlation. 

• A partially connected MIMO interference network with heterogeneous path loss: In practice, 
different cross-links might have heterogeneous path loss due to different geometric distributions 
between Txs and Rxs. For instance, if Z14 = 10"*^, other Iji = 1, all = = I, then this 
network corresponds to an example of a partially connected (since /14 <^ other Iji, the link from Tx 
4 to Rx 1 can be regarded as disconnected.) MIMO interference network with heterogeneous path 
loss. 



C. Limited Feedback Topology 

Assumption 2 (Network Connection and Information State): As illustrated in Fig. [T] (A), we assume 
that there is a BS controller Gc that has backhaul connections to all the Txs. We also assume that 
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the instantaneous CSI is available at the corresponding Rx side but not at the Tx side, and that the 
{^'ji, is available at both the Rx j and Tx j. ■ 

Remark 1 (Practical Considerations): In practice, {^^j, can be obtained at Tx by either 

reciprocity of path loss and spatial correlations or explicit feedback of them from the Rx side. In either 
case, the path loss and spatial correlations are slowly varying and the acquisition of them at the transmitters 
can be done with very small overhead compared with instantaneous CSI feedback. 

In this paper, we deploy the lA algorithm with iterative interference leakage minimization in Q to 
compute the transceivers. As all the CSIs are collected in Gc, we shall implement the lA algorithm in 
a centralized manner, such that there will be no over-the-air iterations among the nodes (over-the air 
iterations will consume excessive signaling overhead and backhaul bandwidth). On the other hand, we 
shall focus on the feedback scheme for the cross links only as the lA algorithm \4j is only related to the 
cross links. The outline of limited feedback topology is described in the following algorithm, and is also 
illustrated in Fig. [T](B). 

Algorithm 1 (Limited Feedback Topology for MIMO Interference Network Adopting lA Processing): 

• Step 1: At each Rx j, the cross link CSI {Hji, Vi 7^ j} are quantized to be {Hjj, Vi 7^ j} using 
the spatial codebooks {Cji, Vi 7^ j} with {Bji, \/i 7^ j} bits respectively. 

• Step 2: The quantized codeword indexes are then fedback to the j-th Tx using feedback link. 

• Step 3: Each Tx j receives the codebook indexes and reconstructs the CSIs to be {Hjj, \/i 7^ j}. 

• Step 4: The Txs forward the reconstructed CSIs to Gc through the backhaul link. 

• Step 5: Based on the collected quantized CSIs from all Txs, Gc computes the lA transceivers as 

{(Ui,Vi)} =IA({H„fc,Vn,A:, n 7^ fc}) (3) 

where lA denotes the lA processing Q and (Uj, Vj) denotes the designed lA transceiver. 
. Step 6: Gc distributes{(Ui, Vj)} to the Txs. 

• Step 7: Each Tx i forwards Uj to the Rx i using the forward control link. ■ 
The design of the spatial codebook and bit allocations {Cji, Bji i j} mentioned in Algorithm 1 

will be discussed in detail in Section III. Note that these designs are adaptive to the path loss and 
spatial correlations which are long term statistics. Hence, once the spatial codebooks and bit allocations 
are determined, each Rx will quantize the instantaneous CSIs independently using the corresponding 
codebooks and allocated bits. 

Assume that the network is lA feasible, we have the following properties about |(Uj, Vj)| |Q, 

Uf U. = Idxd, Vf Vi = I^xd, e {1, • • • K}. (4) 
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tjf H,-,Vi = Odxd, Vi + j, j G {1, • • • i^}. (5) 

Due to the limited feedback CSI, the lA transceiver cannot achieve perfect aUgnment and thus there 
will be some residual interference. Denote the residual interference to noise ratio (RINR) at Rx j as Ij, 
we have 

P ^ 

h = ^Y.h^\\^'^i^^i\?■ (6) 

III. Limited Feedback with Dynamic Quantization 
Define the dynamic feedback policy for the network as 

V = {Cji,B,,,ii^2,^i,2 = \---K}, (7) 

where Cji is the codebook for link Hjj, and Bji denotes the bits allocated for Cji. We investigate in this 
section how the limited feedback scheme V is designed to adapt to the XTV in the MIMO interference 
network. We first illustrate the motivation of dynamic quantization according to the XTV based on a toy 
example. We then elaborate the details of the proposed feedback scheme V, which is divided into two 
parts, namely the spatial codebook design in Section III-B and dynamic quantization via bit allocations 
in Section III-C. 



A. Motivation 

Consider a. K = 4, Nt = 3, Nr = 2, d = 1 interference network where ^\.^ = diag([ 2.8 0.1 0.1 ]), 
/i3 = 1, **4 = diag([ 1 1 1 ]), /i4 = 0.1 (all other = = I, Iji = 1), transmit SNR P = 1. 
Assume that all the other channel matrices are perfectly known by the BS controller Gc except for H13 
and H14. We shall then investigate below the feedback scheme for these two links only. As the fading gain 
from Tx 3 to Rx 1 is much larger (Z13 » /14), it is probable that better performance may be achieved if we 
concentrate on quantizing H13 only. Moreover, as H13 = llf^{{^\oJ^) = H^3-diag([ 2.8 0.1 0.1 ])°'^ 
according to ([2]) and thus the columns of H13 have different gains. It is probable that better performance 
can be achieved if the codebook to quantize H13 has the same statistic distribution as H13. To verify this 
hypothesis, we compare the performance of the following two quantization schemes. Note the dynamic 
quantization scheme illustrated below is only a simple toy scheme which helps to show the idea. In the 
following two schemes, Cq denotes a random vector quantization codebook |[7[ with (6x1) codewords. 

• Conventional VQ: Allocate equal bits to Hi3,Hi4 and use Co to quantize vec(Hi3), vec(Hi4) |[7|. 
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Sum feedback bits 


4 


10 


16 


Conventional VQ 


0.9057 


0.5826 


0.3219 


Dynamic Quantization 


0.3055 


0.1595 


0.1333 



Table I 

Residue interference comparison Versus bits 



• Dynamic Quantization: Use all the bits to quantize H13 only, the codebook to quantize vec(Hi3) 

[ diag([ 28 28 1 1 1 1 D^ '-f 
is given by C = < u | u = , f G Co 

[ l|diag([ 28 28 1 1 1 1 ])°-'-f|l 

The comparison of RINR at Rx 1 versus the sum feedback bits of the two links is illustrated in Table |l] 
We see that the Dynamic Quantization scheme can achieve much lower RINR than the Conventional VQ. 
This example demonstrates the potential benefit of dynamic quantization according to the interference 
topology profile. In the following, we shall elaborate the details of the proposed scheme that can adapt 
to the general TTV given in Def. 1. 

B. Spatial Codebook Design 

In this section, we shall propose a novel spatial codebook design to capture the asymmetric interference 
topology profile of MIMO interference channel defined in Def. 1 . From the toy example in the motivation 
part, we see that better system performance can be achieved by deploying spatial codebook given by 
transforming a base codebook with the corresponding spatial correlation matrices. Based on this intuition, 
we illustrate below how these spatial codebooks {Cji} are designed. 

Algorithm 2 (Spatial Codebook and Quantization Criterion): Each codebook Cji = {Wjj, • • • W^^'}, 
W'^ e d^A^^xTVt _ 2^3') is designed by transforming a base codebook (the base codebooks can 
be obtained by using the quantization cell approximation model in ||8j|) C^^ = {Sjj • • • S^^'}, where 
S^j G £NrxNt ^jjj, jjjg spatial correlation matrices i.e. 

With the input matrix H^j, the selected codeword Hjj in codebook Cji is given by 

Hji = arg max ||vec(Hji)^vec(Wy ||. (9) 
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Remark 2 (How the Spatial Codebooks adapts to the XTV): We transform a base codebook C^j with 
the spatial correlation matrices to obtain the spatial codebook Cji. Therefore, the spatial dis- 

tribution of the CSI Hjj matches the codewords in the spatial codebook Cji for all j, i, and thus less 
average quantization distortion will be induced. Furthermore, the quantization resolution of the spatial 
codebooks {Cjj} is also adaptive|^to the heterogeneity of XTV among the cross-links to further enhance 
the feedback efficiency. 

Remark 3 (Linear Complexity of Spatial Codebook Design): In the above design, the fixed base code- 
books {Cjj} are stored at both the Tx and Rx side. Whenever the spatial correlations change, the new 
codebooks can be found by transforming the base codebooks using the new spatial correlation matrices 
([S]). The overall complexity of the codebook design is thus 0{N), where is the number of codewords. 

Before we can analyze the RINR, we have to quantify the quantization distortion in terms of the bit 
allocation {Bji] as well as the XTV parameters. The relationship between the actual CSI H^j and the 
quantized CSI Hjj is given by: 

Hji = QjiHj, + AHji (10) 

where aji is some unknown complex scalar and vec(AHji) is the quantization distortion distributed 
in the orthogonal complement space of vec(Hjj). The following lemma gives an upper bound on the 
average quantization distortion. 

Lemma 1 (Average Quantization Distortion): Denote Dj^^ = E{||AHjj|p} as the average quantiza- 
tion distortion. Under high-resolution assumption (i.e., Bji is sufficiently large.), the average quantization 
distortion DjJ^ is upper bounded by 



where the distortion coefficient /3jj is given by 



mn 



(12) 

where kji^i = jjr-^j-zj, /cji,2 = '^mj^m^^-i ' ^ {1^ • • AiJJ, n E {1, • • • MjJ, and each of {ymn} is 
i.i.d. chi-square distributed with degree of freedom 2. 

Proof: Please See Appendix [B| ■ 

^This enables us to perform dynamic quantization among the cross-links, which is discussed in detail in Section III-C. 
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Remark 4 (Comparison with i.i.d. Case): When = = I, then H^j is i.i.d. complex Gaussian 
distributed. From ( 12 1, we get Pji = Nr-Nt — 1 and the distortion bound in (11) reduces to 



j^avg < ^upp ^ (^^^^ _ . 2-^^ = E{||H,,||2} • -^j^j;^ ' 2"^^ (13) 



which is consistent with the distortion value derived in |[8|, ||23j. 

Based on the above lemma about quantization distortions, we obtain the following theorem which 
describes an upper bound on the average RINR. 

Theorem 1 (Upper Bound of Average RINR): Denote Ij^^ = IE{/j} as the average RINR at Rx j 
under high-resolution assumption (i.e., Bji is sufficiently large for all i, i j), Ij^^ is upper bounded 

by 

where is given in Lemma [T] 

Proof: See Appendix [C] ■ 

C. Dynamic Quantization via Bit Allocations 

Based on the spatial codebook {Cji} designed in the previous section, we further perform dynamic 
quantization via bits allocations {B*^} in order to exploit the heterogeneity of path loss and spatial 
correlations among different links. Denote the sum feedback bits for all the cross links as B, we formulate 
the dynamic quantization as follows, which aims to minimize the sum of the average RINR upper bounds 
at all Rxs. 

Problem 1 (Dynamic Quantization via Bit Allocation): 

K 

jupp 

K 

^ B (15) 



mm 



S.t. 



where is given in Theorem 1. ■ 
Theorem 2 (Bit Allocation Solution): The optimal solution to Problem 1 is given by 

(16) 



(W-M^-- - I) ( loK ( 1 + b 



where h satisfies = 
Proof: Please see Appendix [Pj 
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Remark 5 (How the Dynamic Quantization adapts to the XTV): We shall use two examples to illus- 
trate how the dynamic bit allocation ([T6jl exploits the heterogeneity of the XTV. Consider the case when 



= *^ = for all i, 3 Then Mj- = M\ MJ. = M^ /3ji = /3 for all j, i, i / j, and 

+ 



thus we have B*- 



(iWM* — 1) ^log (^ (j\^r^y„i)2 ^ + ft^ according to (16 1. From this expression, 
we see that in this case, links with smaller path loss (larger value of Iji) will be allocated more bits; 
Consider the case that Iji = I, = I for all j, i, i j, and that B is large such that b will dominate 



log ( (M^m^-ir ) fo'" ^'^^ j ■ Thus we get B*^ « (M[,M*, - 1)6 = (iV.M*, - 1) 



rt 



From this expression, we see that in this case, links with smaller Mj-, which corresponding to larger 
transmit spatial correlatioi]^ will be allocated less feedback bits. With these adaptive allocations, we can 
achieve less aggregate distortion and thus achieve less residual interference after lA suppression. From 
these examples, we see that the proposed dynamic quantization exploits the heterogeneity of XTV and 
hence the feedback efficiency is enhanced. 

IV. Performance Analysis 

In this section, we analyze the network throughput of lA under limited feedback for the iiT-user MIMO 
interference networks. We first derive a network throughput lower bound (LB) for given average RINR 
at each Rx. Combining this result with the upper bound of the average RINR in Theorem [T] we obtain 
the network throughput LB under the proposed feedback design V = {Cji, B*-}, and express it in terms 
of the number of feedback bits {B*^}, the transmit SNR P and the XTV parameters. Finally, we show 
that when the number of feedback bits scales with SNR as Cs ■ log SNR, the sum degrees of freedom of 
the network are preserved. Moreover, the value of scaling coefficient Cs can be significantly reduced in 
networks with asymmetric interference topology. 

We shall first impose the following assumption on the statistics of the direct channels. 

Assumption 3 (Direct Channel Statistics): Assume that all the direct channels statistics are as follows: 
= = I, /jj = 1, Vj G {1, • • • K}, which corresponds to the i.i.d. Rayleigh fading model. ■ 

Note that the lA scheme |4| is only related with the cross links. Hence, we give a simple channel 
model for the direct links and focus on analyzing the limited feedback scheme for the cross links to obtain 
elegant insights. Consider a joint decoding strategy for the desired signal streams and denote {(Uj, Vj)} 
as the perfect CSIT lA transceiver. Then the network throughput under perfect CSIT can be expressed 

as g, 

"Smaller M|i (1 < M% < Nt) means that the channel matrix Kji has smaller number of transmit directions, which 
corresponds to larger transmit spatial correlations. 
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Rper = J]E jlogdet {l + :^(Uf H,-,V,)(Uf H,-,V,)^j | . (17) 

Following the above definition and treat residual interference as noise, we define the network throughput 
under limited feedback as 

= jlogdet 1^1+ ^(UfH,,V,)(UfH,,V,)^ + ^ Z,,(Uf H,,V,)(Uf H,,V,)« 

■ (f8) 



where {(Uj, Vj)} are the practical lA transceivers (|3jl designed with quantized CSI. 
Theorem 3 (Throughput under Perfect CSIT): Rp^r can be expressed as follows 

Rper = + I ■ ■ ^^^^ 

where f{v) is the marginal probability density function (p.d.f.) of the unordered eigenvalues of the {d x d) 
central Wishart matrix with d degrees of freedom and covariance matrix I (Wrf(I, d)) fTT\ (closed-form 



expression of f{v) can be found on page 32, |22|). 

Proof: See Appendix [E] ■ 

Due to the limited feedback, the network throughput Rum is always upper bounded by Rper, i-c, 
Riim. < Rper- In the foUowing, we derive a LB of Rum under the proposed feedback scheme. By 
decoupling the signal terms and interference terms, and by deriving the convex property of Rum with 
respect to (w.r.t.) the eigenvalues of the interference covariance matrix (Please refer to Appendix [F] for 
details), we obtain the following LB on Rum using Jensen's inequality. 

Lemma 2 (Throughput LB for Given Average RINR): Given average RINR IE{/j} at Rx j, the network 
throughput Riim in ([T8]) is lower bounded by 



Rum > J] d • log [l + ^E{/,} + ^-v] f{v)dv - J] d • log [l + Inij}) (20) 

where f{v) is given in Theorem^ 

Proof: Please See Appendix [F] ■ 
Remark 6 (Advantages of the LB in Lemma |2]).- Lemma |2] gives an approach to bound the network 
throughput in terms of the average RINR at each Rx. First, it is applicable to general d (d > 1) data 
stream cases. Note that many previous works on limited feedback system |[7|, |[8|, |21 1 focus on single 
stream case (d = 1) due to the mathematical difficulty to analyze matrix functions. However, we start 
with matrix analysis and overcome these difficulties to get a comparatively more general result. Second, 
the LB provided in Lemma [2] is tighter than the conventional analysis result in |21| (The LB in |21| is 
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equivalent to the right side of (20) by setting the first IE{/j} = 0.)- The numerical comparison of the two 
bounds is also illustrated in Section V. 

By combining Lemma |2] with the upper bound of the average RINR given in Theorem [T] we obtain 
the following throughput bound in terms of the number of feedback bits {B*.j), the transmit SNR P and 
the TTV parameters. 

Theorem 4 (Throughput LB under Proposed Feedback Scheme): Under the proposed dynamic feed- 
back scheme V = {Cjj, B*-^), Rum is lower bounded by 

Rum > Rlo^ = Ef=l d ■ /o^" log ^1 + P • E 

where f{v) is given in Theorem [3] Pji depends on ITV and is given in Lemma [T] 

Proof: By substituting the upper bound expression of lE{/j} in Theorem^into Lemma ^ we can 
get the desired expression. ■ 
Remark 7 (Interpretation of Theorem [?]).• For lA under limited feedback in general MIMO interference 
networks, the previous works |11 1, p2| focus on analyzing the feedback bits scaling law. Thus it gives 
the performance at extremely high feedback bits regime only and failed to quantize the performance 
when we have finite feedback bits. However, with the above result in Theorem [4j we can quantize the 
throughput in terms of the transmit SNR P, the number of feedback bits B and ITV parameters. For 
instance, consider a homogeneous i.i.d. fading case, i.e., = = I, Iji = 1 for all i, j. Then the 




above term ( [21] ) reduces to 

Rlim > Rlow = (^1 + ^^^^ • vj f{v) ■ = Rper (jT^^ 

B 

where oj = P{K - 1)2 K{K-i){NrNt-i) _ xhis indicates that Rum shall be no less than the throughput 
under perfect CSIT with a power degradation ratio of 

To obtain some simple insights on how the asymmetric TTV will affect our system performance under 
the proposed scheme, we give the following corollary. 

Corollary 1 (Feedback Bits Scaling with SNR): Denote pji = j^Ij^^' (0 < Pji < 1). When the 
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number of sum feedback bits B scales with log P a^ 

K 

B> Yl {l{i,.>o} ■ {NrNtpji -1)} -log P + Cb (22) 

where Cb is some bounded constant independent of P, we have that the sum DoFs of the network are 
preserved, i.e., 



lim 



Kd. 



(23) 



P-s>oo log P 

Proof: See Appendix [G] ■ 
Remark 8 (Interpretation of Corollary [7|.- We achieve a similar result with pT] |, p2| on limited feed- 
back analysis of lA on MIMO interference network, that the sum DoF achievable by lA can be maintained 
when the number of feedback bits scales on 0{\ogP) ( [22) ). Moreover, different from the i.i.d. channel 
fading model assumption in these works, we consider a general asymmetric interference topology and 
show how the asymmetric XTV can be exploited in the proposed dynamic feedback scheme to reduce 
feedback bits. From Corollary [T] we see that in networks when the spatial correlation matrices are not 
full rank, i.e., MJ^ = rank($^J < Nr, Mj- = rank($*-J < Nt (such that pji < 1) for some j, i, or in 
networks when the path loss is so large such that Iji = for some j, i, the scaling bits (22i to maintain 
the sum DoFs of the system could be reduced. 

V. Numerical Results 

In this section, we verify the performance gain of the proposed scheme through simulations. We shall 
first give the following random interference topology model for the cross links. Note the direct links are 
still assumed to have homogeneous path loss and i.i.d. fading (Assumption [3]l. 

Definition 2 (Random Interference Topology Model): Assume the dynamics of the cross link is con- 
tributed by both shadowing effect and transmit spatial correlation. The shadowing effect is modeled by 
log-normal shadowing, and the transmit spatial correlation is modeled using the Exponential Correlation 



Model described in pAj . Therefore, in the ITV, 

1 



I 



e 
1 

*\Nt-2 



Here the notation /{ij.>o} denotes the indicator function. In practice, we might never have exactly zero path gain. However, 
when the path loss Iji is so large such that the interference power is always below the noise floor within our SNR operation 
regime, we can treat Iji = and thus have /{;^..>o} = 0. 
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where {Iji} for different cross-links are assumed to be i.i.d., u is set to be —^S^ to normalize the mearj^ 
of large fading parameters Iji, Vj, i / j to be 1. ■ 
Under the above model, the dynamics of the interference topology can be expressed by two parameters, 
|e| and 6"^, which stand for the dynamics of the spatial correlation and the dynamics of shadowing effect 
respectively. Note that |e| =0 corresponds to no correlation and |e| = 1 corresponds to the strongest 
correlation; while larger 6^ corresponds to larger dynamics of the shadowing effect. In the following 
simulations, we compare the performance of the proposed dynamic feedback scheme (DFS) with the 
following baselines. 

• Conventional VQ (CVQ): Each cross-link is allocated equal feedback bits, and MIMO codebooks 
with symmetrically distributed codewords are deployed to quantize the vec(H). 

• Half Dynamic Scheme 1 (HDSl): Deploy spatial codebooks (Section III-B) but assign equal bits 
to all cross-links. 

• Half Dynamic Scheme 2 (HDS2): Deploy symmetric codebooks but dynamically allocate bits to 
the cross-links (Section III-C). 

• Random Beamforming (RB): Each Tx, Rx randomly choose a precoder and decorrelator. 

A. Performance Comparison w.rt. Amount of Feedback 

In Fig. [2| we consider & K = A, Nt = ?>, N^. = 2, d = 1 MIMO interference network. We vary 
the number of the feedback bits B and compare the network throughput of different schemes under 
the following parameter settings: interference topology dynamics (|e|, 5"^) = (0.7, 3), transmit SNR 
lOlog^o-f = 25dB. It shows that DFS can achieve higher throughput compared with the baselines, and 
larger performance gain over CVQ is achieved in relatively higher feedback bits regime. This shows 
that the proposed dynamic scheme can better adapt to the interference topology and thus achieves less 
performance degradation. On the other hand, we see that the proposed LB of DFS (derived in this paper) 
can better bound the DFS than conventional LB derived according to | |2T| , especially in low feedback 
bits regime. 

B. Throughput Comparison w.rt Transmit SNR 

In Fig. [3} we consider a K = A, Nt = 3, Nr = 2, d = 1 MIMO interference network. We vary the 
transmit SNR P and compare the throughput of different schemes under the following parameter settings: 

*The expectation of a log-normal distributed variable is E(lnA/'(u, 5^)) = e^'^^^ . 
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Throughput Comparison Versus Bits 




100 150 200 250 

Sum Feedbacl< Bits 



350 



Figure 2. Throughput comparison versus sum feedback bits under(|e| , 5^) = (0.7, 3) and 10 logjQ P — 25dB. 



interference topology profile dynamics (|e|, 6"^) = (0.7, 3), sum feedback bits B = 120 and B = 300. The 
reason that we have two B settings is to help illustrate how the throughput goes with SNR in different 
feedback bits regimes. It is shown that under both B settings, DFS can achieve a higher throughput than 
the baselines. Moreover, larger performance gain is achieved in the high SNR regime. 

In Fig |4j we consider a K = 4, Nt = 3, Nr = 2, d = 1 MIMO interference network. We vary the 
transmit SNR P, scale the sum feedback bits with SNR as B = K{K -l){NrNt-l) log P (see Corollary 
1) and show the throughput of different schemes under interference topology dynamics (|e|, 5"^) = (0.7, 3). 
From this figure, we can see that DFS achieves a larger throughput compared with the baselines, which 
demonstrates its performance advantages. Moreover, in the high SNR regime, we see that DFS, HDSl, 
HDS2 and CVQ have the same slope as the perfect CSIT throughput. Therefore, the sum DoFs of the 
network are maintained under this feedback bits scaling condition. 

C. Throughput Comparison w.r.t. Interference Topology 

In Fig. |5] and Fig. |6] we consider a K = A, Nt = Q, Nr = 4, d = 2 MIMO interference network. The 
reason that we change to d = 2 is to help verify that the proposed scheme is also applicable io d > 1 
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Throughput Comparison Versus SNR 

40 I , 1 , , 
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P(dB) 

Figure 3. Throughput comparison versus transmit SNR under (|e|,(5^) = (0.7,3) and B — 120, 300. 



Throughput Scaling with SNR 




5 10 15 20 25 30 



P(dB) 

Figure 4. Throughput scaling with transmit SNR under (|el,(5^) = (0.7,3) and B = K(K - l)(NrNt - 1) log P. 
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Throughput Comparison Versus Correlation Coefficient 
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Figure 5. Throughput comparison versus correlation coefficient |e| under 5^ = 3, lOlogj^QP = 25 dB and B = 828. 



schemes. 

In Fig. [Sj we vary the correlation coefficient |e| and compare the network throughput of different 
schemes under the following parameter settings: shadowing dynamics 5"^ = 3, transmit SNR 10 logio P = 
25dB and sum feedback bits B = 828. Note we choose a moderate number of feedback bits to illustrate 
and compare the performance in residual-interference limited region. From Fig. |5j we observe that as |e| 
goes higher, DFS and HDS 1 achieve larger performance gains over CVQ while HDS2 does not. This is 
because DFS and HDSl (but not HDS2) deploy the spatial codebook design (Section III-B). From this 
fact, we conclude that the spatial codebook design indeed captures the spatial correlations of the channel 
matrices and thus improves the feedback efficiency. On the other hand, by comparing HDSl and HDS2, 
we see that under the proposed random interference topology model, the spatial codebook design can 
contribute more to the performance gain in the relatively higher spatial correlation region. 

In Fig. [6j we vary the shadowing dynamics 5"^ and compare the network throughput of different schemes 
under the following parameter settings: correlation coefficient |e| = 0.7, transmit SNR lOlog^o P = 25dB 
and sum feedback bits B = 828. We see that as 5"^ goes higher, DFS and HDS2 achieve larger performance 
gains over CVQ while HDSl does not. This is because DFS and HDS2 (but not HDSl) deploy the 
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60 



Throughput Comparison Versus Shadowing Dynamics 
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Shadowing Dynamics 5 

Figure 6. Throughput comparison versus shadowing dynamics under |e| = 0.7, lOlogjQP = 25 dB and B = 828. 

dynamic quantization via bit allocations. From this fact, we have that the bit allocations indeed captures 
the shadowing dynamics and thus improves the feedback efficiency. On the other hand, by comparing 
HDS 1 and HDS2, we see that under the proposed random interference topology model, the bit allocations 
can contribute more to the performance gain in the relatively higher shadowing dynamics region. 



VI. Conclusions 

In this paper, we consider MIMO interference networks and investigate the performance of lA under 
limited feedback. We consider a general interference topology model which embraces various practical 
situations such as spatial correlations and path loss effects. A novel spatial codebook design with dynamic 
quantization is proposed to adapt to the path loss and spatial correlations. We analyze the performance 
bounds under the proposed dynamic feedback scheme, in terms of the transmit SNR, feedback bits and the 
interference topology parameters. Both analytical and simulation results show that the heterogeneity of 
path loss and spatial correlations can be exploited in the proposed scheme to enhance feedback efficiency. 
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Appendix 

A. Preliminaries of Codebook Design Heuristic: Transformed Codebook 



In 1 25 1-|27 1, the transformed codebook design is proposed to improve the limited feedback performance 
on correlated MISO channel. We now briefly illustrate the main ideas of these works. Denote the MISO 
channel representation as 

y = h^f -x + n (24) 

where the correlated channel state is modeled as h = Rg, g is i.i.d. complex Gaussian distributed with 
zero mean and unit variance and = RR^is the transmit correlation matrix known at the Tx side. The 
goal is to design an efficient beamforming codebook (note the best beamforming vector under perfect 
CSI should be f = ||^|) so as to reduce the performance loss induced by limited feedback. It is shown 
that the codebook given by: 

fi I fi = c, G Co) (25) 

||Rci|| J 

where Co is a base codebook with codewords symmetrically distributed in the Grassmannian subspace, 
can adapt to the channel correlation and achieve near optimal performance. 



In 1 27 1, 1 28 1, an upper bound of the performance loss by using this transformed codebook is derived 
via high-resolution asymptotic analysis. It is shown that the asymptotic distortion of a finite rate feedback 
system is given by 

L> = E{Dq(v,v)} = / / m(v;z;Ez(v)Xv,z)A(v)"^dvdz (26) 

Jq 

where Dq is the user defined distortion function, B the quantization bits, m(v; z; Ez(v)) the normaUzed 
inertial profile, p(v,z) is the probability density function at point v with side information z, and A(v) 



is the codeword point density. Please refer to works |27|, p8| for the specific details. 



B. Proof for Lemma [7] ( CSI Quantization Distortion ) 

The subscript is omitted for notation convenience in the following derivations. Denote = 

F''A'"(F'^')^, = F*A*(F*)^ as the eigenvalue decomposition, we have 

($ry/2 ^ F''(A^'')'/^(F^)-^, = F*(A*)'/^(F*)-^. (27) 

Define 

E = (F^)^HF*, E = (F^)^HF*, AE = (F'')^AHF*. (28) 



Therefore, we have E = aE + AE according to ( lOl. We shall prove the following two lemmas first 
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Lemma 3: vec(AE) is distributed in the orthogonal complement space of vec(E) and E{||AE|p} 
E{||AH||2}. 



Proof: From (10 1 and (28), we have: 

vec(AH)^ • vec(H) = vec(AE)^ • ((F*)* ® F'')^ • ((F*)* ® F'^) • vec(E) 

vec(AE)^ • vec(E) = 

where (go) comes from the fact that (F*)* (g) F'' is a unitary matrix. Therefore, vec(AE) is distributed 
in the orthogonal complement space of vec(E). The formula E{||AH|p} = E{||AE|p} directly comes 
from the unitary invariance property of Frobenius norm [29]. ■ 
Lemma 4: As quantization bits i? — oo, the asymptotic average distortion E{||AE|p} is upper 
bounded by 

E{||AE|p} < (3 ■ 2'^"'^''-^ 

where /3 is a constant that depends on = rank($''), M* = rank($''). 

Proof: Based on we have H = W'°, where Iq is given by: 

lo = arg max ||vec(H)^vec(W')| = arg max ||vec(F''E(F*)^)'f^vec(W^)|| 

1<«<2S 1<1<2^ 

= arg max ||vec(E)^ • vecffFn-^W^fF*))!! 

1<Z<2^ 



(hi) 



arg max llvecfE)^ • vec 

KK2B 



(A'-)V^(F^)^^S'F*(At)V2 

^ ^rr (A*«) A'')V=vec ((F^^S'F*) , 

arg max vec(E) --- — ; —, ttf — ; — rrrr 

^i<«<2«" ^ ' ||(A*® A'-)V2vec((F^')^S'F*)||' 



where in (hi), W' 
further obtain 



F^(An'AFn"S'FHAW(F^)"\\ to fl ^^^^^ °" r5|l, (|27|), we can 



'(A*)V2(Ft) 



vec(E) = vec((F'^)^HF*) = vec((F^)-^W'°F*) 
(A* A^)V2vec ((F'-)^S'«F*) 
||(A*0 A'-)V^vec((F'^)-f^S'«F*) ||' 

Therefore, vec(E) can be regarded as the selected codeword from the codebook with input vector 

vec(E), i.e.. 



vec(E) = arg max ||vec(E)^f; 



feci 



f I f 



P^vec ((F^)^S'F*] 
|P5vec((F'^)^S'F*) 



(29) 
(30) 



where P 
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Before trying to calculate the quantization distortion, we shall first eliminate the non effective di- 
mensions |j27l. As vec(E) = P2vec ((F'")^H'^F*) from (28 1 where A*" = diag([ Ai • • • \m- ]), 
A* = diag([ a ■■■ gm^ ]), we have that there are [N^Nt — PFM^) entries of vec(E) always 
being null. Besides, the corresponding entries in all codewords in codebook are also zero according 



to (30 1. 



Denote the non-zero support (non-zero index set) for vec(E) as S and the cardinality |S| = M^M^. 
Denote g(A) as the reduced vector formed by elements of g whose index lies in A. We get the following 
reduced quantization model (which includes the source channel model, the codebook, and the codeword 
selection criterion): 

f 

channel correlation model h = T^x^ 

transformed codebook C# = <! f, = T^.(vec((F'-)«s'F')(s)) , ^ 

■|T2.(vec((F-)«S'F')(S))r 



(31) 



h = argmaxf^gc* \ 



codeword selection 

where h = vec(E)(S), h = vec(E)(S), x„ = vec ((F')^H'"F*) (S) G C^^'^^^*^^ and 

T = P(S, S) = diag([ AicTi, A2cri ••• Am^cti, AiCT2 ••• ,Aa,/.o-m* ])• (32) 

As H*" is i.i.d. complex Gaussian distributed and F*", F* are unitary matrices, we have (F'')^H"'F* as 
well as Xm are i.i.d. complex Gaussian according to the bi-unitarily invariant property of Gaussian random 



matrix |22|. On the other hand, the codewords in {vec ((F^')^S'F*) | S' G C°} are also isotropically 



distributed |27|. Therefore, we can deploy the similar problem formulation described in |27J and use the 



high-resolution asymptotic analysis to calculate the quantization distortion. Define the distortion function 
to be 



Dq = ||vec(E)||2 . (1 - I (Ve, Ve) H = ||h||2 . (1 - I (v;,, V;,) , 

where Vg = jj^|^7gT||» = and Vg, v/i are the corresponding quantized vectors. 



(33) 



We get the inertial profile ?fit,._c(v,h) |27| is upper bounded by 



mtr-c(v, h) < 



It 



h||2(vfT-iv,: 



Tr((I-v,vf)T). 



where t = M^M (please refer to |27|, |28| for value of 7t). The codeword density A(v) is 



A(v) = 7r'.det(T)-i.(v^T-iv, 



By substituting ([34]), (|35]) into (|26]), we get 

, , det(T)V'- r(h^T-ih)(""/'-^)(Tr(T)||h||2-h^Th)l 

HDq) < 'J e|A ^ ||h||(-4-') j • '""^ 



(34) 



(35) 



(36) 
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Substitute the expression of T (32) into the above formula, we can get the expression of /? as shown 
in Lemma ^ For the details of the omitted derivation, please refer to |27|, |30|. 
We can express the relationship between the vec(E) and vec(E) as: 

vec(E) = ||E|| • I (ve, ve) |e^'^ • vec(E) + ||E|| • (l - | (ve, Ve) ■ vec(Z) 

where vec(Z) is a random unit-norm vector distributed in the orthogonal complement space of vec(E), 
(•) denotes the inner product operator of two vectors. By setting a = ||E|| • | (ve, Ve) \e^^, AE = 
||E|| • (l - I (ve, Ve) \^y^^ • Z, we obtain 



E{||AE||2} = E{||E||2(1 - I (ve, Ve) \')} = E{Dq}. 



Substitute the upper bound on E,{Dq} in (36 1 into (37 1, we get the lemma is proved. 
By combining Lemma ^and Lemma ^ Lemma ^is proved. 



(37) 



C. Proof for Theorem 1 (Upper Bound of average RINR) 



From (10 1 and (28 1 we have 



H 



a,,H,, + F^^,AE,,(F*,)^ 



Based on ([Hjl and the fact that U|^HjjVj = in (jsj), we have 



P 

~d 



K 



J]/,,||UfF,^,AE,,(F5,)^V, 



We shall prove the following lemma first. 
Lemma 5: We have the following property Vj, i ^ j, 



E{||Uf F,^,AE,,(F$,)^V,|p} < ^^^^^ _ ^ 



^7» j« 



2MV. M*. -1 



Proof: Denote G^^ = Uf FJ^ G C'^^^^ G^^ = (F*J^Vi E C^'^*^. We get that G^-^ has orthonormal 
rows and G^j has orthonormal columns. Denote Gjj(m) as the m-th row of G^j, Gjj(n) as the n-th 
column of GJj, 1 < m, n < d. We get 

E{||UfF;',AE,,(F*,)^V,||2} = J]E{||G^,(m)AE,,G,^,(n)||2} 

m,n 

= J]E{||vec (G^,(m)AE,,G,^,(n)) l^} 

m,n 

= ^e{\\ (G;,(nf 0G^,(m)) -veclAE.OlP} 

= {ll {G'^.inf ^ G^,(m)) (Sjj • vec (AE,,(S,,)) ||2}(38) 
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where Sji denotes the non-zero support for the column vector vec(Ejj) (as explained in Lemma |4] in 
Appendix |Bj), as the transpose of the support E>ji (note we use Sj- for G^j(n)^ (g) G^j(m) as it is a 
row vector). 

We shall then illustrate two facts. First, we have tjf F^.Eji(F*.)^Vi = according to and thus 
(G5,(n)^ ® G^,(m)) (Sp • vec (e,,,) (S,,) = [G^j^f ^ G^d^)) ■ vec (e,,) = 0. 

Therefore, ^^G^j(n)-^ (S" G^j(m)^ (^Ji)) ^^^^ '■'^^ i^ji-^ji ~ 1) dimensional orthogonal comple- 
ment space of vec (Ejjj (Sjj). Second, under large quantization bits, the codeword density (35 1 near 



vec(Ejj)(Sjj) can be approximated as uniform and thusvec(AEjj)(Sjj) is approximately isotropically 
distributed in {MJ^Mj- — 1) dimensional orthogonal complement space of vec(Ejj)(§jj). Based on these 
two facts, we have 



Ie{|| (G;,(nf »G^,(m)) (Sp • vec (AE,, )(§,■.)! p} 

Ie{|| (G;,(nf ®G^,(m)) (Sj,)||2||vec(AE,,)(S,,)||2} • E (beta(l,Mj,Mj, - 2)) 



< E{||vec(AE,-i)(Sii)in ^ 



MJ,M*, - 1 



J1 Ci MV.M^.-l 



. 2 (39) 



MV. M* . -1 



M^iM'j, - 1 

where (e) comes from the fact that ||G^j(n)^ G^j(m)|| = 1, beta(-) denotes beta distribution ||8 
By combining ( [38| ) and (39 1, Lemma^is proved. 
Based on Lemma^ we easily get: 

< 7 E /..K{||Uf F,^.AE,,(F5,)^V.||^} <Pdf: (j^,^^) 2 

D. Proof for Theorem 2 (Bits Allocation Solution) 

Formulate the Lagrangian with multiplier 7, and set the derivative w.r.t. Bji and 7 to zero 

^^•77l#^-2"^+7 = 0. (40) 



dL 



K 



E - ^ = 0- (41) 
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From (40 1, we get 



Bji 



{MJ,M^, - 1) ( log 



{M^^^Mt - 1)2 



+ h 



where h = log ^^^y^- Combine the above expression with (41 1 as well as the condition that Bji > 0, we 
get the desired solution. 



U, 



V • 



E. Proof for Theorem 3 (Throughput under Perfect CSIT ) 

Given any (Uj,Vj), we can construct unitary matrices Xjj - 
As {(Uj,Vj)} are independent of the direct channel states {Hj^}, we have that {(Uj, Vj)} are also 
independent of them. By combining this feature with the fact that {Hjj} are i.i.d. complex Gaussian 
distributed, we get Xj^Hjj'Vj is also i.i.d. complex Gaussian distributed according to the bi-unitarily 
invariant property of i.i.d. complex Gaussian matrix p2| . Therefore, 





Uf H,-.V^- = Uf (U,Uf )H,,(V,Vf )V, 



I 

dxd 



dxd 



(42) 



where Hjj denotes the left upper {d x d) sub matrix of (U|^HjjVj), and is thus i.i.d. complex Gaussian 



distributed. Therefore, Hjjtljj is a central Wishart matrix with degree of freedom d and covariance 
matrix Id . We have. 



R 



■per 



K 



logdet(I+^H,,Hj5 



Kd 



P 

log(l + -J • v)f{v)dv 
d 



where f{v) is the marginal probability density (p.d.f.) function of the unordered eigenvalues of the [d x d) 
central Wishart matrix with d degrees of freedom and covariance matrix 1^ W^(I(i, d)) [22] (closed-form 



expression of f{v) can be found in page 32, |22| ). 



F. Proof for Lemma 2 (Throughput LB for Given RINR) 

For any given (Uj, Vj), we can construct unitary matrices Uj 



V • 



As {(Uj, Vj)} are independent of the i.i.d. complex Gaussian matrix Hjj, we have U^^HjjVj is also 



i.i.d. complex Gaussian distributed |22|. Therefore, 



Uf H,,V^- = (U,Uf )H,,(V,Vf )V 



tH 



(UfH,,V,) 



Id 




Hence,Uj^HjjVj (the left upper {d x d) sub matrix of Uj^HjjVj) is i.i.d. complex Gaussian and is 
statistically independent of (Uj , Vj ) according to the bi-unitarily invariant property of i.i.d. complex 
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Gaussian matrix |22|. On the other hand, as Hjj is independent of {Hjj, i ^ j}, we get that Uj^HjjVj 
and Uj^HjjVj (for all i j ) are conditionally independent given {(Uj, Vj)}. Combine this feature 
with the fact that Uj^HjjVj is statistically independent of {(Uj,Vj)}, we have that JJj^HjjYj is 
independent of Uj^HjjVj for all i ^ j. Hence, we get that the desired signal and interference signal are 
decoupled. Denote 

P ^ 

^ E lji(Ufilj,Vi)(UfUj,Vif = nj^jRf (43) 

as the eigenvalue decomposition, where Hj is a unitary matrix and Sj is the diagonal matrix with 
real positive eigenvalues. We have that the i.i.d. complex Gaussian matrix JJj^HjjYj is independent 
of the unitary matrix Rj. Therefore, we have Hj = Hj^JJj^Hjj'Vj is also i.i.d. complex Gaussian 
distributed statistically independent of R|^, and thus is also independent of Therefore, we can first 



take expectation w.r.t. "Ej, and then w.r.t. for Rum in (18 1, i.e., 

Ru^ = E^{H]} jlogdet (^I + ^ (H,^(Hp^) (I + ^^O"') }} • (44) 

To help prove the theorem, we shall first prove the following lemma. 

Lemma 6: The function ^(X) = logdet (l + AX^^) is convex w.r.t. X, where A € C'^^'^ is a constant 
Hermitian positive definite (PD) matrix and X is defined on D^^ = |diag([ xi ■ ■ ■ ]) I a^i > 0, Vi|. 

Proof: g(X^) = logdet (X + A) — logdet (X). According to |31|, the second order differential of 
<7(X) is 

d^giX) = -1- • dvec(X)'^ • ^x,x5(X) • dvec(X) 
in 2 

where ?^x,x5(X) = -((X + A)^)'^ (g) (X + A)-^ + (X^)"^ X-^. Since X + A ^ X and both 

X + A,X ^0 (here A ^ B means that A-B is PD), then (X + A)-^ ^ X-\ ((X + A)^)-i ^ (X^)"\ 

and it is easy to verify that ^x,x5(X) >z [29]. Therefore, g(X) is convex w.r.t. X. ■ 

With the convexity property in Lemma ^ and using the Jensen's Inequality on (44i, we have 

K f / p 

RHm> E ^ { log det f I + - (H^^(Hp^) (I + E {Ej})-' 
j=i 

and Tr (E {S,}) = E {Tr(I],)} = E {Tr(f . /.^(Uf H,-,Vi)(Uf H.-.V,)^)} = E(/,). 

Denote as a permutation matrix with dimension d, and the set of all permutation matrices with 
dimension d as P^. Since Hj(Hp^ is a central Wishart matrix, we have that 
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Rh^ > f]E|logdet(^I + :^(H^«(Hp^)(I + E{I],})-i)| 

= J] E jlogdet (l + ^ (H,'(Hp^) (I + P. • E {S,} • P.)-^) } 



for any P^ G P^. Further using Jensen's inequality, we get 



Rum > f;E|logdet||l+^(H](Hp^) (^I+i ^ P,.E{S,}.PJ^ j| 



K 



where in (r), H^(Hp^ is a central Wishart matrix with d degrees of freedom and covariance matrix 1^ 
(Wd{ld, d)) ), f{v) is given in Theorem |3] 

G. Proof for Corollary 1 (Scaling Law with Transmit SNR) 



With (21 1 and by Jensen's inequality, we can further get 



K 



Rlim — 



3=1 



+ 00 



P 



K 



K 



P 



K 



i=i 



> R„er - -fCd • log IH V 



2MV. Alt . -1 



As limp_ 



,oo = Kd, we get that the sum DoFs of the system are kept if 



E 



P 



K ^ \ MJMi - 1 



2A/r.M* . -1 



<Co, 



(45) 



where Co is some bounded constant that does not depend on P. 



As P — oo, S — oo, we have 6 — oo in ( 16 1. Therefore, we have that when Lj / 0, B*, 



It 



{M^iM^ji - 1)6 + Cji; when Iji = 0, 5*. = in (16 1. Substitute these {B*-} into (45 1, we obtain 
b > logP + Ci- Hence, the sum feedback bits is 

K K 

B= Bj,= E (MJ,Mj,-l)6 + a> 5] {/|,^,>o}-(A^riVtP,^-l)}logP + Cfe, 
where Cji, Ci, C;, above are some bounded constants independent of P. 
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